A New Hierarchy of Phylogenetic Models Consistent with Heterogeneous Substitution Rates

نویسندگان

  • Michael D. Woodhams
  • Jesús Fernández-Sánchez
  • Jeremy G. Sumner
چکیده

When the process underlying DNA substitutions varies across evolutionary history, some standard Markov models underlying phylogenetic methods are mathematically inconsistent. The most prominent example is the general time-reversible model (GTR) together with some, but not all, of its submodels. To rectify this deficiency, nonhomogeneous Lie Markov models have been identified as the class of models that are consistent in the face of a changing process of DNA substitutions regardless of taxon sampling. Some well-known models in popular use are within this class, but are either overly simplistic (e.g., the Kimura two-parameter model) or overly complex (the general Markov model). On a diverse set of biological data sets, we test a hierarchy of Lie Markov models spanning the full range of parameter richness. Compared against the benchmark of the ever-popular GTR model, we find that as a whole the Lie Markov models perform well, with the best performing models having 8-10 parameters and the ability to recognize the distinction between purines and pyrimidines.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling the Heterogeneous Price Setting Behavior of Firms (DSGE Approach)

Despite the empirical evidence of the difference in the degree of price stickiness of goods and services, in the new standard Keynesian models, the same price stickiness is considered for all firms producing intermediate goods. In recent years, a new generation of pricing models has been introduced to simulate the heterogeneous price setting behavior in which, unlike standard pricing models, th...

متن کامل

Bayesian Selection of Nucleotide Substitution Models and Their Site Assignments

Probabilistic inference of a phylogenetic tree from molecular sequence data is predicated on a substitution model describing the relative rates of change between character states along the tree for each site in the multiple sequence alignment. Commonly, one assumes that the substitution model is homogeneous across sites within large partitions of the alignment, assigns these partitions a priori...

متن کامل

Maximum likelihood estimation of phylogenetic trees is consistent when substitution rates vary according to the invariable sites plus gamma distribution.

Maximum likelihood estimation of phylogenetic trees from nucleotide sequences is completely consistent when nucleotide substitution is governed by the general time reversible (GTR) model with rates that vary over sites according to the invariable sites plus gamma (I + gamma) distribution.

متن کامل

A model for employer’s decision for recruitment

This paper presents a theoretical and empirical analysis of this decision and how it relates to wage setting and the provision of general training. The theoretical framework is a promotion tournament involving M competing firms with heterogeneous productivities, two-level job hierarchies, and a fixed number of managerial positions. This paper also considers an alternative model based on variati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 64  شماره 

صفحات  -

تاریخ انتشار 2015